policy evaluation
Country:
- North America > Canada (0.04)
- Asia > China (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Denmark (0.04)
- Asia > Middle East > Jordan (0.04)
Industry:
- Health & Medicine (0.67)
- Information Technology > Security & Privacy (0.45)
Technology:
Country:
- North America > Canada > Alberta (0.14)
- North America > United States > Texas > Travis County > Austin (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- (2 more...)
Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Data Science > Data Mining (0.67)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)
Country:
- North America > Canada > Alberta (0.14)
- Asia > Middle East > Jordan (0.04)
- Europe > Hungary > Budapest > Budapest (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Country:
- Europe > Switzerland > Basel-City > Basel (0.04)
- Asia > Middle East > Republic of Türkiye (0.04)
- Asia > China (0.04)
Technology:
Country:
- Europe > Switzerland > Basel-City > Basel (0.04)
- North America > United States > New Hampshire (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- (3 more...)
Industry:
- Information Technology (0.87)
- Health & Medicine > Therapeutic Area > Immunology (0.46)
Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.86)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.51)
Solving Zero-Sum Markov Games with Continuous State via Spectral Dynamic Embedding Chenhao Zhou
In this paper, we propose a provably efficient natural policy gradient algorithm called Spectral Dynamic Embedding Policy Optimization ( SDEPO) for two-player zero-sum stochastic Markov games with continuous state space and finite action space. In the policy evaluation procedure of our algorithm, a novel kernel embedding method is employed to construct a finite-dimensional linear approximations to the state-action value function.
Country:
- Asia > Middle East > Jordan (0.04)
- Asia > China (0.04)
- Europe > Switzerland > Zürich > Zürich (0.04)
Genre:
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.67)
Technology:
Country:
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Sweden > Stockholm > Stockholm (0.04)
- Europe > Portugal > Porto > Porto (0.04)
Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Data Science > Data Mining (0.92)
Country:
- North America > United States (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East > Jordan (0.04)
Technology: